On Reconfiguration Latency in Fault-Tolerant Syst ems1
نویسندگان
چکیده
Digital computers embedded in critical applications such as flight controls should be equipped with appropriate fault-tolerance schemes to ensure their reliable and safe operation in the presence of component failures. System reconfiguration, which enhances reliability by dynamically using spatial redundancy, is generally the most time-consuming faul t/errorhandling stage. The reconfigurutzon latency, defined as the time taken for reconfiguring a system upon failure detection or mode change, depends on many parameters, including the size of application programs and data, the CPU and memory speed, built-in testing capabilities, the type (cold, warm, or hot) of spares to use, the system architecture, and the reconfiguration strategy used. In this paper, we classify the reconfiguration techniques into four types: reconfigurable duplication, reconfigurable N-Modular Redundancy (NMR), backup sparing, and graceful degradation. For each type of reconfiguration, we ( i ) evaluate the reconfiguration latency by using several parameters accounting for the aforementioned parameters, and (ii) determine if this type of reconfiguration can meet the application required latency. Index Terms Reconfiguration latency, dynamic redundancy, processor and task parameters, backup sparing, graceful degradation, cold, warm, and hot spares 'The work reported was supported in part by a Texas Instruments Grant, the Office of Naval Research under Grant N00014-91-J-1115 and by the NASA under Grant NAG-1-1120. Any opinions, findings. and conclusions or recommendations expressed in this paper are those of the authors and do not necessarily reflect the view of the funding agencies. 0-7803-24734/95/$4.00 e 1995 IEEE 287
منابع مشابه
A New Design of Fault Tolerant Comparator
In this paper we have presented a new design of fault tolerant comparator with a fault free hot spare. The aim of this design is to achieve a low overhead of time and area in fault tolerant comparators. We have used hot standby technique to normal operation of the system without interrupting and dynamic recovery method in fault detection and correction. The circuit is divided to smaller modules...
متن کاملA fault tolerant NoC architecture using quad-spare mesh topology and dynamic reconfiguration
Network-on-Chip (NoC) is widely used as a communication scheme in modern many-core systems. To guarantee the reliability of communication, effective fault tolerant techniques are critical for an NoC. In this paper, a novel fault tolerant architecture employing redundant routers is proposed to maintain the functionality of a network in the presence of failures. This architecture consists of a me...
متن کاملTowards a Strongly Fault Tolerant VLSI Processor Array
In this paper a novel methodology to achieve fault tolerance in VLSI Array Processors is proposed. A “Fence” based approach is adopted in which the logic array is partitioned and spares are distributed along the boundary of the active array. The emulator as in conventional fault tolerance techniques takes care of fault mapping and reconfiguration. The latency, reconfiguration interconnect lengt...
متن کاملControl Reconfiguration: Survey of Methods and Open Problems
This report surveys recent results and approaches from the field of control reconfiguration within a general fault-tolerant control framework. Open problems for future research are identified. Control reconfiguration is embedded within a wider fault-tolerant control context and set apart from other approaches to achieve fault tolerance. It is explained why simpler approaches than control reconf...
متن کاملFault-Tolerant Systolic Array Design With Partially- Reconfigurable FPGAs
Partial dynamic reconfiguration, PDR, is an important feature of modern reconfigurable architectures such as the Xilinx Virtex FPGA devices. In this paper we focus on the application of PDR to fault-tolerant systolic arrays in one dimension. Single as well as multiple faults in one or more array processing elements, PEs, are considered. In our approach modular redundancy is not used and the goa...
متن کامل